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Abstract 

Online social media have greatly affected the way in which we communicate with each 
other. However, little is known about what are the fundamental mechanisms driving dynami¬ 
cal information flow in online social systems. Here, we introduce a generative model for online 
sharing behavior that is analytically tractable and which can reproduce several characteris¬ 
tics of empirical micro-blogging data on hashtag usage, such as (time-dependent) heavy-tailed 
distributions of meme popularity. The presented framework constitutes a null model for so¬ 
cial spreading phenomena which, in contrast to purely empirical studies or simulation-based 
models, clearly distinguishes the roles of two distinct factors affecting meme popularity: the 
memory time of users and the connectivity structure of the social network. 


1 Introduction 

Recent advances in communication technologies and the emergence of social media have made it 
possible to communicate rapidly on a global scale. However, since we receive pieces of information 
from multiple sources, this has also made the information ecosystem highly competitive: in fact, 
users’ influence and visibility are highly heterogeneous and topics strive for users’ attention in 
online social systems. Although several studies have described the dynamics of information flow 
in popular communication media the main factors determining the observed patterns have 

not been identified and there is no theoretical framework that addresses this challenge. Indeed, 
given the potential for applications—e.g., having more efficient systems to spread information 
for safety and preparedness in the face of threats—a better understanding of how memes (ideas, 
hashtags, etc.) emerge and compete in online social networks is critical. 

Information often spreads through a social network as a cascade: a person adopts a new be¬ 
havior or installs a new app, or sends a news item or rumour to their friends (e.g., by tweeting it 
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on Twitter). The avalanche spreads if the friends decide to also adopt the new behavior, and in 
turn pass on the social influence effect to their own friends, who may further propagate the be¬ 
havior. Following the usage in the review [6], we apply the term “social spreading phenomena” to 
describe such cascading or “viral” propagation [7] . The latter term is used because the description 
of information spreading bears some similarity to epidemics of contagious disease; the effects of 
network structure on disease contagion have been well-studied by physicists [8] , see [9] for a recent 
review. However, unlike epidemics of a single disease strain, we focus on social spreading phenom¬ 
ena that occur in the presence of competition between a large number of different items of similar 
type. Examples of the types of items include URLs on Twitter mm, apps on Facebook mm, 
or videos on YouTube m- In each of these examples, users make choices—often influenced by 
the choices they have seen their friends make—and the accumulation of many choices leads to a 
distribution of popularity of the items: some items become extremely popular, while other items 
remain obscure. 

To enable a succinct general description, we will call such items by Dawkins’ term [12] “memes” 
because they are all “elements of a culture or system of behavior passed from one individual to 
another by imitation... ” m- Note that we do not restrict our study only to very popular memes; 
indeed our interest is in understanding the entire popularity distributions of memes, from the 
unpopular to the very popular. This dehnition of a meme has also been used by researchers 
studying cascades on Facebook m, the spreading of news through blogs [T5] , and the popularity 
of hashtags on Twitter mm, but it can also be applied to analyze popularity distributions 
of offline items (where copying promotes spreading) such as baby names [T7j, dog breeds [T8] . 
and even to citations (which are a type of popularity measure) of scientific papers [I9l|20]. The 
memes in these examples are all relatively simple units of information that are easily identified 
in data sets; recent work has also demonstrated that more complex memes (represented by the 
appearance of common phrases, such as “quantum” or “graphene”, in the scientific literature) 
can be recognized by their inheritance patterns in the citation network m- 

A notable characteristic of many meme popularity distributions is that they are very fat¬ 
tailed: if a power-law distribution is fitted to the data then the power-law exponent r is typically 
between 1.5 and 2, which lies outside the range of exponents produced by models of cumulative- 
advantage |22H24j or preferential-attachment [25] type. The statistical physics of avalanches has 
been studied in the context of condensed-matter systems, where the flip of a single magnetic spin 
domain can cause its neighboring domains to also flip and so initiate a cascade [2^. If the physical 
parameters of such a system are tuned to place it at a critical point m the sizes of avalanches are 
power-law distributed; the sandpile model of self-organized criticality (SOC) self-tunes so that the 
system balances at the critical point [28]. However, unlike the memoryless particles or magnetic 
spins that constitute the microscopic entities in condensed-matter avalanches, humans absorb and 
transmit information on a wide variety of timescales that range from seconds to weeks [291130]. 
Models of social interaction must therefore include “memory” effects (non-Markovian aspects) 
that lead to the emergence of characteristics that are qualitatively different from those seen in 
condensed-matter avalanches. The non-Markovian aspects of human temporal behavior have 
attracted considerable recent attention (e.g. [3IH35]), but we wish to investigate the effects of 
memory on popularity avalanches caused by users choosing between multiple items that they 
have seen in the past. 

To address this problem, we develop a theoretical framework that models how users choose 
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among multiple sources of incoming information and affect the spreading of memes on a directed 
social network, like Twitter m- Our probabilistic model, in contrast to other studies [3l l5lll9p36j 
that use intensive computational simulations to fit to data, allows us to get analytical insights into 
the respective roles of the network degree distribution, the memory-time distribution of users, and 
the competition between memes for the limited resource of user attention. The model is a “null 
model” in the sense that it is analytically tractable, yet realistic enough to be fitted to empirical 
data and to reproduce some important characteristics of the data. We show that fitting to time- 
dependent data requires a non-trivial memory-time distribution, which is not possible with the 
toy model of Ref. m, where users can remember only one meme. However, the phenomenon 
of “competition-induced criticality” that was first identified in m is shown to be robust to the 
inclusion of memory-times, heterogeneous user activity rates and complex network structures in 
the more realistic model used here. The current model requires more sophisticated mathematical 
analysis than that of Ref. [37] to deal with the long memory of users, but it enables us to 
understand how heavy-tailed distributions of meme popularity evolve over a range of timescales, 
as a few memes “go viral” but the majority become only moderately popular. 

We phrase the model in terms of meme propagation on a directed social network (like Twitter) 
and interpret a “meme” to be any distinct piece of information that is easily copied and trans¬ 
mitted (e.g., a hashtag or URL within a tweet). However, it should be clear that the model and 
its results can also be extended to the other examples of viral phenomena discussed above. For 
the adoption of apps on Facebook, for example, the memes are the notifications sent when a user 
installs an app m- If a friend is prompted by this notification to also install the app, then the 
meme propagates on the network and its popularity is measured by the number of installations of 
the app. We show that the crucial property of the model that poises the system at criticality is 
the competitive pressure for the limited resource of user attention, and this property is common 
to a broad range of social spreading phenomena that are characterized by the availability of large 
time-dependent data sets. 

The remainder of the paper is structured as follows. The model is introduced in Sec. [S] in 
Secs. El and m we derive and analyze a branching-process description of the model dynamics. 
We confirm the results of this analysis using numerical simulations in Sec. [5] and then use the 
analytical results to fit the model to hashtag popularities extracted from micro-blogging data in 
Sec. m and to explain novel features of the time-dependent data. In Sec. |7|we discuss limitations 
of the model and possible extensions of it. Note that the Secs. [3] and 01 may be omitted on a first 
reading without affecting the understanding of the model and the main results. 

2 Model 

In online communication platforms like Twitter, users follow (receive the broadcasts or “tweets” 
of) other users. In graph-theoretical terms, these relationships constitute directed links from 
the followed node (user) to the follower (Fig. 1). The network structure is defined by the joint 
probability pjk that a randomly-chosen node (user) has in-degree j (i.e., follows j other Twit¬ 
ter users) and out-degree k (i.e., has k followers), but the network is otherwise assumed to be 
maximally random (a configuration model directed network). The mean degree of the network is 
z = ^ kpjk = k^Pjk- If simplify the model by assuming that all users follow z others—as 
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we sometimes do to highlight the role of the out-degree distribution—then pjk can be replaced 
with dj^zPk, where 6j^z is the Kronecker delta and pk is the out-degree distribution. 

Each user has a “stream” that records all tweets received by the user, time-stamped by their 
arrival time. We assume that only a fraction A of the tweets received are deemed “interesting” 
by the user, and only the interesting tweets are considered for possible retweeting by that user. 
(Here we use the term “retweeting” in a general sense, to include any reuse of a previously-received 
meme such as a hashtag: note that a meme may be retweeted more than once by a user, unlike the 
model of Ref. [35] ). The activity rate of a user—the average number of tweets that she sends per 
unit time, i.e., the rate of the Poisson process that describes her tweeting activity—can depend 
on how well-connected the user is within the social network [3|, and we assume it depends on 
her in-degree j and out-degree k (her “(j, A:)-class” for short); this assumption is supported by 
empirical evidence from Twitter, see Fig. 6 of Ref. [38] . The user activity rates Pjk give the 
relative activity levels of users in the {j, k) class; the rates are normalized by choosing time units 
so that '^jkPjkPjk = 1- If there are N users in the network, this rate implies that an average of 
N tweets are sent in each model time unit. To simplify the analysis, we will sometimes specialize 
to the case where all user activity rates are equal: Pjk = 1. 

When a user decides, at time t, to send a tweet, she has two options (see Fig. 1); with 
probability p, the user innovates, i.e., invents a new meme, and tweets this new meme to all her 
followers. The new meme appears in the user’s own stream (it is automatically interesting to the 
originating user), and in the streams of all her followers (where it may be deemed interesting by 
each follower, independently, with probability A). If not innovating (with probability 1 — p), the 
user instead chooses a meme from her stream to retweet. The meme for retweeting is chosen by 
looking backwards in time an amount tm determined by a draw from the memory-time distribution 
and finding the first interesting meme in her stream that arrived prior to the time t — tm. 
The retweeted meme then appears in the streams of the user’s followers (time-stamped as time t), 
but because it is a retweet, it does not appear a second time in the stream of the tweeting user. 
The popularity n{a) of a meme is the total number of times it has been tweeted or retweeted 
by age a, i.e., by a time a after its first appearance (when it was tweeted as an innovation) [T9]. 
Figure [2] shows some examples of evolving meme popularities: each panel displays the popularity 
n(a) of a single meme as a function of its age a. 

The model as described is a “neutral model” [391140] in the sense that all memes have the 
same “fitness” m- no meme has an inherent advantage in terms of its attractiveness to users. 
Nevertheless, the competition between memes for the limited resource of user attention causes 
initial random fluctuations in popularities of memes to be amplified, and leads to the variability 
across memes seen in Fig. [2| and to popularity distributions with very heavy tails [17]: heavier, for 
example, than can be generated by models of preferential attachment or cumulative advantage 
type [22l[23l[25l|42l[43]. This “competition-induced criticality” was studied for a zero-memory 
= S{tm)) version of this model in Ref. [37]. Indeed, the results of Ref. [37] can be obtained 
as a special case of the model described here, by setting 4>(tm) = A = 1, Pjk = 1, and 

Pjk = SjzPk] numerical simulation results for a closely related model were first reported in Ref. [3]. 

A branching process approximation [Mlil] for the model enables us to understand how the 
network structure (via the out-degree distribution pk) and the users’ memory-time distribution 
(<I>(tm)) affect the popularity distribution of memes. Defining qn{a) as the probability that a 
meme has popularity (total number of (re)tweets) n at age a, the branching process provides 
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Figure 1: Schematic of the model. (A) Timeline of users’ actions in a typical realization of the 
model. User A is followed by users B and C; arrows between nodes denote the direction of 
information transmission. Note that user B also follows many other users, and so his stream 
contains more memes than the streams of A or C. At time t^, user A retweets a previously-seen 
meme (with probability 1 —/i, given A is active). She chooses the red meme to retweet, by looking 
backwards in her stream a distance determined by the memory-time distribution (only memes 
that A deemed “interesting” are shown in her stream). Her retweet of the red meme is accepted 
as “interesting” (and so inserted into their stream) by each follower of A with probability A. At 
time user C retweets the red meme to his followers, so further increasing the popularity of the 
red meme. At time tf user A innovates (a probability /r event, given A is active) by inventing 
the new blue meme and broadcasting it to her followers. (B) Branching process representation 
(Sec. [31) of the popularities of the red meme and of the blue meme. Each retweet generates new 
branches of the process, as the meme is inserted into the streams of followers of the tweeting user. 
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Figure 2: Examples of the age-dependence of meme popularity from numerical simulations of 
the model. Each panel shows the popularity of 10 different memes; the memes plotted are chosen 
at random from those whose popularity at age 100 is of order (A) 10^, (B) 10^, or (C) 10^. For 
model parameters, see the caption of Fig. |3]A. 


analytical expressions that determine the probability generating function (PGF) [451146] of the 
popularity distribution, 

OO 

H{a-, x) = ^ qn{a)x'^. (1) 

n=l 

The details of the derivation and analysis of the branching-process approximation are given in 
Sec. [3] and 01 The reader who is mainly interested in the applications of the model may jump 
straight to Sec. 01 while noting that the most important outcome of the analysis is that in the 
small-innovation limit /r —)• 0, the model describes a critical branching process, with power-law 
distributions of popularity (avalanche size) [471150] . 

3 Derivation of branching process approximation 

3.1 Derivation of governing equations 

We define Gjk{T,Q; x) as the probability generating function for the size of the “retweet tree”, as 
observed at time fl, that grows from the retweeting of a meme that entered, at time r < 11, the 
stream of a (j, /i;)-class user, see Fig. [3f3. To obtain an equation for Gjk, we consider the stream 
of a random (j, /c)-class user (called “user A”) with a meme M that entered the stream at time r 
(either by innovation, or because it was received from a followed user and deemed interesting by 
A), see Fig. 014. 

The likelihood that meme M is retweeted in the future depends on how quickly other tweets 
enter the stream of user A. In fact, meme M can be considered to “occupy” the stream for a 
time interval ^ stretching from r until the time t + (. when the next interesting meme enters the 
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Figure 3: Schematic for the derivation of the PGF equations, see Sec. [3l (A) The stream of user 
A, showing only memes that were deemed interesting by user A; each color represents a different 
meme. At time t, user A decides to retweet a meme from the past, and looks back to time r, 
where she finds meme M (colored red). She sends this meme to her followers (not shown); each 
follower independently deems the meme interesting with probability A. Also shown is a later 
retweet event, which also copies meme M. (B) The retweet tree for meme M, seeded at time r. 
Each retweet by user A of meme M generates a new branch on this tree; each branch can also 
generate further retweets by followers of A, these subtrees are denoted by squares. (C) Schematic 
depiction of Eqs. d?]) and (fTHll . 
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stream of user A. New memes enter the stream as a Poisson process at the constant rat^il 


rjk = j/3A + ^il3jk, (2) 

so the occupation time i of meme M—the time it occupies the stream of user A —is an exponen¬ 
tially distributed random variable with density 


-Pocc(^) = 'Tjfcexp {-rjki). 


(3) 


We note in passing that the mean occupation time 


{£) = r £ P^^^{£) d£ = 

Jo j/3X + 




(4) 


is, for small innovation probabilities /r, inversely proportional to j, the number of users followed. 
Thus, a user who follows many others experiences tweets entering his stream at a higher rate than 
a lower-j user (compare the streams of users B and C in the schematic Fig. 1). Consequently, the 
high-j user is less likely to see (and so to retweet) a given meme than a low-j user. This aspect 
of the model clearly reflects empirical data, as seen in Fig. 3 of [51] for example. 

To determine the size of trees originating from meme M, we consider that trees observed at 
a time n > r must be created by the retweeting by user A, at some time(s) between r and fl, 
via looking back in her stream to a time r, where r lies between r and min(r -|- £, fl) (i.e., r lies 
within the time interval where meme M occupies the stream). Let’s consider a time interval of 
(small) length dr, centered at time r, and calculate the size of trees that are seeded by a retweet 
based on a lookback into this interval, from a time t, with t > r, see Fig. [3l In each dt interval 
centered at time t, a tree will be seeded with probabilit}{l 


^seed = (1 - £L)/3jk*^{t - r) dr dt, (5) 

and will grow to a tree with size distribution (at observation time 12) generated bjH 

Rk{t, 12; x) = X [1 — A -|- XG{t, 12; x)]^ , (6) 

where 

G{t,Q;x) = '^^PjkGjkit,Q-,x) (7) 

j,k 

is the PGF for the sizes of trees originating from the successful insertion at time 2 of a meme 
(that is deemed interesting) into the stream of a random follower. 

^User A follows j users, each of which is assumed to tweet at the average rate /3 = ^PjkPjk- Each meme 
sent by these j users is deemed interesting by A with probability A, so the rate at which interesting memes enter 
the stream of user A is jPX. Moreover, user A innovates at a rate pPjk, which gives the second term of Eq. ([2|). If 
either an incoming tweet or an innovation event occurs, a new meme is inserted into the stream of user A, and the 
occupation time of meme M is ended. 

^The factor (1 — p)l3jk dt is the probability that a {j, fc)-class user becomes active in the dt interval and copies 
rather than innovates; the factor <I>(t — r) dr is the probability that this user chooses to copy from the dr-interval. 

^There are k followers of user A, each of whom may deem the tweet “uninteresting” with probability 1 — A, or 
consider it “interesting”—and accept it into their stream—with probability A. The factor of x counts the increase 
in popularity due to the tweet event. 




To calculate the total size of the tree seeded by copying from the dr-interval, we must add the 
sizes of trees that are copied into all times t with t > r. Since each copying event is independent, 
the total tree size is generated by 

n 

J{r; x) = [1 - Pseed + PseedRkit, a?)] . (8) 

t=r 


Taking logarithms of both sides of this equation and expanding to first order in dt gives 


log J 


= X] [1 - (1 - - r) dr dt{l - Rk(t, Q] x))] 


Ri -(1 - - r)drdt{l - Rk(t,^l-,x)) 

t=r 

I'Q, 

—{1 — fi)f3jk dr / — r){l — Rk{t,^l] x)) dt as dt ^ 0, 

Jr 


so J{r;x) can be written as 

J(r; x) = exp 


-{1 - n)/3jkdr / d>(t - r)(l - Pfc(t,n;x)) dt 


(9) 


( 10 ) 


Recall that J(r; x) is the PGF for trees seeded by copying from time r. To obtain the total size of 
all children trees of meme M, we must consider trees seeded at all possible times r from r to the 
time min(r + i, fl) that marks the end of the occupation of user ^’s stream by meme M. Each 
dr time interval again independently generates trees with sizes distributed according to Eq. m, 
so the PGF for the total size is found by multiplying together copies of the J{r;x) function for 
each dr time interval, thus: 


min(r+^,r2) 

Psize(^) = J{r;x) 


= exp 


exp 


min(r-|-£,0) q 

-{l-fi)j3jk ^ dr ^{t-r){l-Rk{t,n;x))dt 

r=T 

fmin(T-|-£,f2) rfl 

dr / dt— r)(l — ri;x)) 


r 

-{l-fi)j3jk J 


as dr —)• 0. (11) 


Gombining probabilities, by integrating over all possible occupation times i, gives 


Gjk{r,n-,x)= PoMPsiMM 

Jo 


( 12 ) 
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and combining Eqs. m, © and (fTTI) yields an integral equation for G: 


G{t, n;x) = 'y] -Pjk [ d£ {jj3X + nPjk) exp [- (j/3A + n/3jk) i] 

^ ^ JO 

-il-n)(3jk [ 

Jo 


jk 
X exp 


min(r+£,f!) pQ 

dr J dt^{t — r){l — x[l — X +XG{t,Q]x)]^) 


(13) 


Introducing the change of variables a = XI — t, f = r — t, f = Q — t, we rewrite this equation as 


G{n - a,n-x) = '^ ^pjk j di (j/3A + p/3jk) exp [- (j/3A + p^jk) i] x 

r*min(€,a) pa—f 


jk ^ -^0 


X exp 


pmm[t,a) pa—r 

-{1-H)f3jk / df df$(. 

Jo Jo 


a — f — f)(l — x[l — A + XG{Q — f, H; x)] ) 

(14) 


Note that the only appearance of the observation time XI in this equation is in the first two 
arguments of the G function: this reflects the fact that the popularity of memes in this model 
depends only on their age a (unlike cumulative-advantage models, which exhibit a dependence also 
on the global time because early-born items have an “early-mover” advantage [52]). We therefore 
compress the notation by defining G in terms only of the age a of the memes: G{X} — r; x) := 
G{t,XI]x), and G{a;x) solves the integral equation 
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G(a; x) = 'y] -Pjk / di [jjdX + pidjk) exp [- (j/3A pjdjk) i] 

/*min(£,a) p 

-{l-p)l3jk / df 

Jo Jo 


jk 
X exp 


min(£,a) pa—f 

df I df^{a —f — f){l — x[l —X +XG{f]x)]^) 


, (15) 


with initial condition G(0;x) = 1. 

The popularity of a meme, as observed at time 11, that is seeded by a single tweet (e.g., by 
an innovation) at time r may be calculated in a similar way to the derivation of Eq. (|15l) : the 
generating function is of the form 


H{t,XI;x) = '^(3jkPjkRk{T,Xl-x)Gjk{T,Xl-x), (16) 

j,k 

where fJjkPjk represents the probability that the seed tweet originates from a {j, A:)-class user, Rk 
is the PGF for the trees generated from the followers of the user, and Gjk is the PGF for the size 
of the retweet-tree of the meme (see Fig. (Sp). Introducing the age a of the meme as before and 
defining qnia) as the probability that an age-a meme has popularity n, we have the PGF defined 
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in Eq. ([T|), which is given by 


POO _ _ 

H{a-, x) = W PjkPjkX [1 - A + AG(a; x)f / di [jfdX + pPjk) exp [- {jf3\ + p/djk) i] 

jk 


X 


X exp 


-(1 -/i)/3jfc [ 

Jo 


min(£,a.) pa—r 

dr / df ^{a — f — f){l — x{l — \ +\G{f;x)]^) 

Jo 


; ( 17 ) 


the initial condition is H{0;x) = x (i.e., all memes have initial popularity 1: 9ri(0) = 5^,1 


3.2 Distribution of response times 

It is worth noting that all agents in the model have constant activity rates, so that the actions 
of each individual agent constitute a Poisson process. A Poisson process is characterized by 
an exponential distribution of inter-event times, where each event corresponds to an innovation 
or a retweeting action. This assumption is contrary to studies such as [30 1 [3311^ [53115^ . 

where heavy-tailed distributions of inter-event times are examined. Despite this, in our model 
the memory-time distribution ^{tm) directly influences the waiting times (or “response times”) 
between the receipt of a specific meme, and the retweeting of it. Indeed, if $(tm) is a heavy-tailed 
distribution, then a meme received by a given user at time r will be retweeted by that user at a 
time t (with t ^ t) with probability proportional to <I>(t — r) (the exact relation depends on how 
long the meme occupies the stream of the user). Therefore, a heavy-tailed memory distribution 
gives rise to a heavy-tailed waiting-time distribution for individual memes, despite the fact that the 
activity of each individual user is described by a Poisson process (cf. the heavy-tailed waiting-time 
distributions found in empirical studies of email correspondence |291l3nj). It is clearly important 
to distinguish between the distributions of inter-event times (for actions of users) and of the 
waiting times experienced by individual memes: the model assumes each user has exponentially- 
distributed inter-event times, but it can nevertheless produce heavy-tailed distributions of waiting 
times for memes to be retweeted. 

In particular, if the memory-time distribution <h(tm) is a Gamma(fcG')^) distribution [35] as 
used in Secs. [5] and El i.e., = r(fcG^)6>*^G i~'^rn/d), then <h(tm) is approximately 

power-law for memory times with tm ^ d, with an exponential cutoff at larger times. The 
corresponding waiting-time distribution shows a similar scaling in this range, like the slow decay 
noted in empirical response times for Twitter users (e.g., in Fig. 5 of |5T]). In Sec. [8]we consider 
how the model could be extended to incorporate bursty (non-Poisson) user activity. 


4 Analysis 

4.1 Criticality of the branching process 

A branching process may be classified by the expected (mean) number ^ of “children” of each 
“parent”: if this number (called the “branching number”) is less than I, the process is subcritical 
and if ^ is greater than I the process is supercritical. Critical branching processes, with an average 
of exactly one child per parent, give rise to power-law distributions of tree-sizes and of durations 
of growth cascades, and have been used to examine self-organized criticality in sandpile models 
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on networks [471150] . Here we demonstrate that the general process derived in Sec. [3] is a critical 
branching process in the limit of vanishing innovation /r —^ 0. 

We identify the “parent” in the process as a meme that was accepted into the stream (i.e., 
deemed interesting) of a (j, /i:)-class user at time r: see, for example, meme M in the stream of 
user A, as shown in Fig. [3l The “children” of this meme are the retweets of it that are accepted 
into the streams of the followers of A at any time t > t. The PGF for the number of children of 
meme M is derived by following the same steps as in Sec. El but replacing by (1 — A + Xx)^: 
each power of x then counts a successful insertion of meme M into the stream of one of the k 
followers of A. The resulting PGF, for a meme of age a, is (cf. Eq. (ll.Sp i 


1*00 
Jo 

f 

-(1 - n)/3jk 

Jo 


X exp 


min(£,a) ra—f 

dr 


f df <h(a — f — f)(l — [1 — A + Ax]^) 

Jo 


f 


d^Pocc(^)exp 


-(1 - fi)l3jk{l - [1 - A + Xx]^) 


r*min(£,a) 


C{a — f)df 


, ( 18 ) 


where C{t) = ^(tm)dtm is the cumulative distribution function (GDF) for memory times. The 
expected (mean) number of children for a meme in the (j, A:)-class stream is determined from the 
PGF in the usual way [l5], by differentiating with respect to x and evaluating at x = 1, thus: 


^jk 


dKjk 

dx 


X = 1 


(19) 


In the limit of large ages, o oo, we use the fact that C{oo) = 1 to obtain 


~ (1 - d)/liA:AA: / £Pocc{J)d£ as ooo 
Jo 

^ (1 - lx)/3jkXk 

j/3X + ^(3jk 


( 20 ) 


Averaging over all (j, k) classes, the effective branching number ^ of the process is the expected 
number of children of a meme that is accepted into the stream of a random follower: 


i ^ Pjkijk 

j,k 

j 


E j fJjkXk 


as /i —)■ 0 


( 21 ) 


(recall that jJ = jlJjkPjk)- 

Thus, we have shown that the branching process underlying the model is critical when ^ = 0. 
The occupation time of a meme in a users’ stream is due to the competition between neutral¬ 
fitness memes for the limited resource of user attention; this competition ensures that the mean 
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number of successful retweets (children) generated during the finite occupation time of the meme 
is precisely one, and so induces the power-law distributions of cascade sizes that are characteristic 
of critical branching processes [47ll50j . 

It is worth noting that the result of Eq. ()21j) can also be derived in a more heuristic fashion, 
which enables us to discuss possible generalizations of the model in Sec. [71 As above, we want to 
calculate ^jk, the expected number of children of a parent meme M that has been accepted into 
the stream of a (j, A:)-class user, called user A. We consider a (long) time window of duration 
W units. During this time window, a total of approximately {jfiX + ^l3jk)W tweets have been 
accepted into the stream of user A (see footnote [H and Eq. ([1]) ). When user A decides to retweet 
during the time window, one of these memes is chosen for retweeting. If the times chosen by 
the user are uniformly distributed over the window then the probability that the chosen meme is 
meme M is 


^chosen i 

number ot memes m stream 


1 


( 22 ) 


Alternatively, this result can be calculated by noting that the average time that a single meme 
occupies the stream is given by {€} in Eq. (|3|), so the expected fraction of the total time that 
meme M occupies the stream of user A over the window of length W is {€) jW = Echosen- 

Recalling that the activity rate of user A is /3jfc, the expected number of retweets by this user 
during the time window is 

A^retweets = (1 “ IJ')l3jkW. (23) 


Each retweet is broadcast to the k followers of A, each of whom finds the retweet interesting with 
probability A, so the expected number of children (memes deemed interesting by followers) per 
retweet is Xk. The expected number of children of the parent meme M over the time window is 
therefore 


^jk = (number of retweets by A) x (probability meme M is chosen) x (children per retweet) 

— A^retweetsEchosen AA;, (24) 

which recovers Eq. ()2n|) . The expected number ^ of children of a meme that is accepted into the 
stream of a random follower is then calculated as in Eq. (EH), giving 1 in the /i —)• 0 limit. 


4.2 An explicit expression for qi{a) 

The value qi{a) is the probability that a meme, once created via an innovation event, is not 
retweeted by the time it reaches age o: recall that the popularity n of a meme is set to 1 when it 
is first tweeted (i.e., at birth); subsequent retweets (if any) increase the value of n above 1. The 
probability qi{a) may be calculated explicitly using Eq. CZD: 


gi(a) = lim 


H{a] x) 


x^O X 


l3jkPjk [1 - A AG(a; 0)]^ / M Pocc(^) exp 


/*min(€,a) 

-(l-/x)/3jfc / C{a-r)d. 

Jo 


— r)dr 


(25) 
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with G(a; 0) given, from Eq. (fT^ . by 

) 

(i£Pocc(^)exp 


G{a; 0) = ^ ^Pjk 

j,^ 


pm'm{i,a) 

-{l-fi)f3jk C{a-f)dr 

Jo 


(26) 


If we consider the large-age limit, a —>■ oo, than we can approximate the integral of the cumulative 
distribution function for memory times as 


I 


min(£,a) 


C{a — f)df Tn iC(a) 

and the integral over i can be calculated to give the large-a approximation 

+ pfJjk 


(?i(o) ~ ^I n \R nt \ 
^ J/3A -h pPjk + (1 - p)/3jkC{a) 


[l-A + AG(a;0)]' 


with 


r^/ n^ j d'Pjk 

G{a:0) ~ > -pik^ - - -■ 

^ 2 j/3A -h pPjk + (1 - p)/3jkC{a) 


In the simplified case pjk = dj^zPk and fJjk = 1, Eqs. 

Xz + p 


qiia) 


Xz -\- p — p)C{a) 


k=0 


1 - A + A 


and ([29]) reduce to 
Xz + p 


Xz + p + {I - p)C{a) 


(27) 


(28) 


(29) 


(30) 


The a = oo limit of gi(a) gives the fraction of memes that are never retweeted, and so have 
popularity n = 1 forever. The value of qi{oo) is obtained from Eqs. (f^ and ([2^ by setting C{a) 
to its a —)> oo limit of 1. The approach of qi{a) towards the value qi{oo) depends, through the 
CDE C'(a), on the tail of the memory-time distribution <I>. If the distribution <1> is heavy-tailed, 
there is a non-negligible probability that a meme may be retweeted even if a very long time has 
elapsed since its birth. 

4.3 Mean popularity 

The age dependence of the mean popularity (i.e., the expected number of tweets/retweets for a 
meme of age a) is given by 


m{a) = y~]ngn(. 


a = 


dH{a\x) 


n=l 


dx 


(31) 


X=1 


m[a) = 


Differentiating (I17p and setting x = 1 yields an integral equation for m{a): 

Y] /JjkPjk <1 + XkmG{a) + {l- p)j3jk [ di {j^X + p/Jjk) exp [- {j/3X + p/3jk) 
jk -^0 

rmm{£,a) ra—r | 

J J df ^{a — f — f)[l + XkmG{f)]\ , 


(32) 
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where mda), defined by mda) 
differentiating Eq. m- 


dG{a\x) 

dx 


x=l 


is the solution of the integral equation found by 


7 — — 

'mcia) = Y] -Pjk / di (j/3A + fi/3jk) exp [- (j^A + fi/3jk) i] x 

jk ^ -^0 

rmm{i,a) pa—f 

X (1 — / dr df — f — f) [1 + XkmG{f)] . (33) 

Jo Jo 

The order of the time integrals may be swapped using the identity 

^oo pmm(i,a) 


di, 


poo pmin[t,a) pa pc 

/ di dr = df 

Jo Jo Jo Jf 

and the resulting i integral can be performed explicitly: 

poo _ _ 

Jr 


(34) 


(35) 


As a result, the expressions (l32|) and (l33]l can be written as double convolution integrals. Taking 
Laplace transforms, Eq. (|32p then becomes 


(36) 


m(s) = - + z^Xmcis) + (1 - /x)$(s) ^jkPjk ’ 

s Tk 3pX + ^l|3jk + s 

where hats denote Laplace transforms, e.g., 

poo 

l>(s) = / e~''^^{t)dt, 

Jo 

and with rhcis) given explicitly from the Laplace transform of Eq. (1331) : 


(37) 


rhcis) = 




1 - (1 - /i)A$(s) Y.j,k iPjk 




j/3A+^/3jfc+s_ 


(38) 


If we specialize now to the simplified case where (djk = 1 for all (j, k) classes, and pjk = dj,zPk-, we 
obtain the simpler expression 


mG{s) 


(1 - ii)H!>)Tdpr, 




Substituting for ifiG into the simplified version of Eq. ([36]) yields 

„ 1 1 - M {Xz + l)$(s) 

m(s) = - H-^^-. 

s s Xz^s — {1 — p)Xz^{s) 


(39) 


(40) 
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Note that, unlike the expression for qi in Eq. (1301) . the mean popularity depends on the out-degree 
distribution pk only through the mean degree 2 :, implying that the mean popularity is independent 
of the finer details of the network structure. 

To consider the large-age asymptotics of m{a) from Eq. (jdOjl we use results from renewal 
theory |35ll60j . If the Malthusian parameter a exists, where a is defined as the solution of the 
equation 

{I- p)Xz^{a) ^ ^ 

Xz jjj oi 

then the large-age, small-/i asymptotic behavior of m{a) can be written as (Theorem IV.4.2 of |60] 1 


1 1 _tiXl+lla 

m{a) ~-e i+t>.z as a ^ 00, n —)• 0 . 

p p 


(42) 


Here we have nsed the fact that near criticality (i.e., as p —)• 0) the Malthusian parameter a. is 
determined by Eq. ()4T]) to be a = — where T = dtm is the mean 

memory tim^. Setting a = 00 in Eq. (f42]l . we obtain the steady-state value of the mean popularity, 
m(oo) = I//i. Although Eq. (j42p is a large-a asymptotic result, we may expand the exponential 
term about a = 0 provided that the argument of the exponential remains small: this is valid for 
ages a that obey the constraint a <C ■ Taking the /r —)• 0 limit of Eq. (I42|) shows that the 

function m(a) grows linearly with a for ages in this range; 


m{a) ~ 


Xz + 1 
1 + TXz 


(43) 


The preceding analysis all assnmes that the seed node (i.e., the user who hrst tweets the meme 
of interest) is chosen at random from all the network users, with probability weighted by the user 
activity rate. It is straightforward to repeat the steps of the calculations for the case where the 
seed node is known to have k followers, and so to investigate the importance of the connectivity 
of the seed node. Restricting onr attention to the simplified case as above, and taking the inhnite- 
age limit, we find that the expected popularity for a meme that is initiated by a seed node of 
out-degree k is 


mfc(oo) 


+ ^ (I ~ k-) A _ 

Xz + p{Xz + 1 ) J 


(44) 


Note the linear dependence of this expression on the number of followers k of the seed node: 
memes tweeted by users with a large number of followers are likely to become more popular than 
memes seeded by less influential nodes. This feature of the model matches well to the observed 
dependence of the size of information cascades on the connectivity of the initial seed (e.g.. Fig. 2 
of [62]). Of course, the earlier results for randomly-chosen seeds are recovered by averaging over 
all possible seed nodes: m{oo) = ’^^Pk'nikipo) = IjpL. 


"^Note that the Malthusian parameter exists for all the memory-time distributions considered in this paper 
(exponential and gamma distributions). However, if is a subexponential distribution [60] (such as the lognormal 
distribution IM]), then the large-a asymptotics of the mean popularity are related to the memory time CDF by 


instead of Eq. (jig. 


m(a) 


1 


{1 - + ^^) 
/i2(Az-bl) 


(l-C(a)) 
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4.4 Infinite-age limit of popularity distribution 

In the infinite-age (steady-state) limit a —oo, we assume G{a;x) — Goo{x), independent of a, 
and use the fact that <h(t) dt = 1 in Eq. (1151) to obtain 


Goo{x) = Y] -Pjk / d£ {j/3X + fJ./3jk) exp [- (j/3A -h n(3jk) i] x 
jk ^ do 


X exp 


-(1 - n)^jki{l - X [1 - A -b XGoo{x)] ) 
Calculating the i integral then gives the equation satisfied by Goo(a^): 

j „ -b fi/Sjk 


°° ^ ^ ^ + /djk - (1 - ^j)l3jkX [1 - A -b XGooixY 

Similarly, the infinite-age limit for H is given in terms of Goo by 

[jl3X + nl3jk) X [1 - A -b AGoo(x)]^ 


Hoq{x^ — ^ ^ fdjkPjk .■ 


jk 


jjdX + 13jk - (1 - iJ-)l3jkX [1 - A -b AGoo(x)] 


(45) 


(46) 


(47) 


Note that these steady-state equations are independent of the memory distribution function <I>. 
Accordingly, the asymptotic analysis approach used in m to obtain the large-n behavior of the 
popularity distribution qn{oo) may also be applied here: this is based on writing x = 1 — w 
and Goo = 1 ~ 4'i'ix) analyzing the small-re, small-0 asymptotics of Eqs. (f46l) and (j47)) . We 
refer to m for details, and here summarize the main results for the simplified case f3jk = 1, 
Pjk — dj^zPk- 


• Case 1: pk has finite second moment 

The large-n scaling of the popularity distribution is given by a power-law with exponential 
cutoff: 


Q'n(oo) r\j An 2 e 


n 

K 


as n —)• oo. 


(48) 


where the prefactor A i 




^ ^ z{Xz + l) 
Xz + p 


2-k 



(2 -b Xz — p) 
Xz + p 



1 

2 


(49) 


and the cutoff k is 

(A:^) (2 + Xz - p) 

Xz + p 

Note that k is proportional to 1/p’^ for small p, so in the limit of vanishing innovation 
probability the exponential cutoff tends to infinity and the power-law part of the popularity 
distribution extends to all n. 

®The values of A, k and B reported here are not identical to those reported in |37| : this is because of an 
approximation made in the analysis of mi that is not required here (see Eq. (S6) of [3Z1)- However, the differences 
are of order 1/(A«), and so are negligible in the case Xz ^ 1 that is considered in |37| . 



2A2(1-//)2 
/i2(Az + l)2 
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• Case 2: pk ^ D k~'^ as /c —)• oo, with 7 between 2 and 3 

Immediately taking the /x —^ 0 limit, we find in this case that the popularity distribution 
has a power-law form with exponent 7/(7 — 1) lying between 3/2 and 2 |47ll48j : 


Q'n(oo) B n T"-! as n —)• 00 


(51) 


with prefactor B given by 



(52) 


where T is the gamma function. 

4.5 Large-a, large-n asymptotics of popularity distribution 

In Appendix 1X1 we consider how the popularity distribution qn{a) behaves for large, but finite, 
ages, focussing on the case Pjk = 1, Pjk = ^j,zPk for simplicity. The result of the asymptotic 
analysis is an expression for the Laplace transform of the PGF H(a] x) that is valid in the a —>■ 00 
limit, see Eqs. (jA.lip and (IA.12P for the cases of out-degree distributions pk that have second 
moments that are, respectively, infinite or finite. 

5 Results: numerical simulation 

To confirm the accuracy of the branching-process approximation and to explore the interactions of 
the network structure and the memory-time distribution, we here compare numerical simulations 
of the model with the theoretical predictions of Sec. [H We generate configuration-model directed 
networks with prescribed out-degree distribution pk- Each one of N users (nodes) is assigned a 
random number k (drawn from the distribution p^) of out-links (links to followers). The identities 
of the k followers are chosen uniformly at random from the set of all users; in the N ^ 00 limit, 
this gives a Poisson in-degree distribution pj which, for sufficiently large 2 :, gives similar results to 
using the in-degree distribution pj = 6j^z, i-e., assuming every user follows exactly 2 others [37] . 
Each user has the same activity rate, so I3jk = 1. 

Figure |4l4 shows the fraction of memes that have popularity greater than or equal to n, at 
age a. Black symbols are the results of numerical simulations; the colored curves are determined 
from the large-a, large-n, p = 0 asymptotics of Eq. (jA.lip . using the Laplace transform inversion 
described in Appendix [B] The main figure in panel Eig. HjA shows results for networks with the 
scale-free out-degree distribution p^ ~ Dk~^ for k > A and exponent 7 = 2.5 (with Pk = 0 
for /c < 4); the inset shows the results for networks with a Poisson out-degree distribution with 
mean degree z = 11 matching that of the scale-free networks. The memory time distribution is 
= Gamma(A;G,0) with ka = 0.1, 0 = 50 for the scale-free case and kc = 0.1, 9 = 5 for the 
Poisson case; the mean memory time for this distribution is T = kcO. 

Panels B and C of Fig. |4] show results for various memory time distributions ^ on networks 
with the same scale-free out-degree distribution as used in panel A, and panels D and E show 
the corresponding results for the Poisson network. Panels B and D show the fraction qi{a) of 
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Figure 4: Numerical simulations of the model, compared with analytical results. (A) Comple¬ 
mentary cumulative distribution functions (CCDFs) for meme popularity at age a: numerical 
simulation results (black) on a network with scale-free out-degree distribution oc for 
k > 4: with 7 = 2.5, mean degree = 11, N = 10^ nodes), compared with asymptotic model 
resnlt Eq. (lA.llI) (colored curves). The memory-time distribution is $ = Gamma(A:G', 0) with 
kc = 0.1 and 6 = 50, so the mean memory time is T = kc d = 5. Inset: As main, but for Poisson 
out-degree distribution pk = H) and gamma memory-time distribution with kc = 0.1 and 
6 = 0.5. (B) Fraction qi{a) of memes that are not retweeted by age a, on the scale-free network of 
(A) and for various memory-time distributions ^{tm) (red = exponential with mean T, blue/green 
= Gamma(0.1, lOT)), using Eq. (f30|) . Dashed lines show the T = 5 cases; solid lines represent 
T = 1. (C) Mean popularity of memes of age a, for the same cases as in (B), and compared with 
Eq. (j40l) (using the numerical Laplace transform inversion described in Appendix [B]); inset shows 
the large-a behavior. All panels have p = 0.02 and (except for green curves) A = 1. (D), (E): As 
panels B and C , but for a network with Poisson out-degree distribution (mean degree z = 11), 
with /i = 0.02 and (except for green cnrves) A = 1. 
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memes that have not been retweeted by age o, along with the large-a asymptotics of Eq. (1301) . 
The age-dependence of <?i(a) is qualitatively similar in panels B and D: note in both panels that 
the cases with longer mean memory time T = 5 (dashed curves) approach their a —)■ oo limit 
more slowly than the T = 1 cases (solid curves). However, the limiting value of qi{a) as o —)• oo is 
different in the two panels, reflecting the effect of the network structure (out-degree distribution). 
Using Eq. (|3n]i (with C{oo) = 1) we obtain gi(oo) = 0.50 for the scale-free network with A = 1, 
whereas (?i(oo) = 0.37 for the Poisson network. 

The mean popularity m{a) of age-o memes is shown in panels C and E for the scale-free and 
Poisson networks, respectively, and for the same memory-time distributions as used in panels B 
and D. In contrast to the results for ( 71 (a), we see that the hirer details of the network structure 
have no effect on the m(o) curves: panels C and E are identical, because Eq. (j40]l depends on pk 
only through the mean degree z, which is identical for both networks. The mean memory time T 
determines the rate of linear growth of m{a) at intermediate ages (see Eq. (j43[) ). while at early 
ages, the gamma memory time distribution <h(tm) (which has signihcant probability mass at low 
values of tm) gives a faster-than-linear growth of m(a) that is not present for the exponentially- 
distributed memory times. The large-age asymptotics are shown in the insets; as discussed in 
Sec. 14.31 we hnd m{a) —)• 1/// as a —)• 00 . As we show in Sec. [ 6 ] below, the m(a) curves can 
be htted to empirical data on the popularity of Twitter hashtags; note also that the qualitative 
features identified here (nonlinear early growth; linear intermediate-time growth, saturation at 
later times) have also been observed in several other measures of information spread on social 
networks, such as views of YouTube videos [63] and the installation of Facebook apps |5| . 

6 Results: Twitter hashtags data 

6.1 Data and model inputs 

To test the ability of the model to fit real-world data, we use a 1-year dataset comprised of the 
popularities of 1.4 x 10^ hashtags related to the 2011 15M protest movement in Spain that were 
tracked over the 1-year period from March 2011 to March 2012 [641165] . We use all hashtags for 
which we have at least 200 days of data; each curve in Fig. [ 6 ]A shows the popularity distribution 
for all hashtags which have the same age (to the nearest day). 

The out-degree distribution of the Twitter network is an important input to the model. We 
determine the empirical distribution by randomly selecting 8.2 x 10^ Twitter user ids and recording 
the number of followers k of each user. The measured mean number of followers is z = 703, but 
the distribution pfc is heavy-tailed. The complementary cumulative distribution function (CCDF) 
of the k values is shown in Fig. (5] along with the line D/{'y — l)k^~^ with D = 240 and 7 = 2.13 
that corresponds to an out-degree distribution with tail scaling as pk ^ D k~'^ as /c —)• 00 [ 66 ] . 

The model parameter A and the memory-time distribution <h(tm) cannot be directly estimated 
from the data because in cases where users receive multiple copies of the same meme (hashtag) 
prior to retweeting it, it is impossible to tell which of received memes “caused” the retweet. 
Therefore, we instead use the analytical results of the model (Eqs. (jiH]) and ([A.llM to hnd 
parameter values that ht the model to the statistical characteristics of the data. Guided by the 
faster-than-linear growth of the mean popularity at early ages a (Fig. [ 6 ]C) and the results of 
Sec. (5] we assume that the memory time distribution <I> is a Gamma(fcG,0) distribution, and ht 
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Figure 5: CCDF for the number of followers fc of a random sample of 8.2 x 10^ Twitter users. The 
straight line corresponds to an out-degree distribution with tail scaling as pk ^ D k~'^ as A; —)• oo, 
with D = 240 and 7 = 2.13 (xmin = 1-1 x 10“^, fitted as described in [ 66 ]). 


the distribution parameters kc and 6, as well as the model parameters p and A to give the results 
in Fig. [ 6 p. Note that a delta-function memory-time distribution, as used in the toy model of [37] . 
leads to a purely-linear dependence m{a) oc a, and so cannot fit to the early-time growth of the 
observed mean popularity. 

The data does, however, provide an upper bound on the value of the innovation probability p. 
Recall that p is defined as the probability that a tweeted meme (hashtag) is an innovation, i.e., 
that the hashtag has never before appeared in the system. Each innovation event thus increases 
by one the number of distinct hashtags that appear in the dataset, whereas a non-innovative 
(copying) tweet will instead increase the number of copies of a hashtag that is already present in 
the dataset. We can therefore calculate an upper bound on the empirical innovation probability 
from the ratio 


number of distinct hashtags used in the dataset 
total number of hashtags tweeted by users 


322799 

5886837 


0.055. 


(53) 


Note this upper bound is consistent with the parameter value oi p = 0.033 that is fitted in Fig. [UJ 
The reason why Eq. (I53p gives an upper bound rather than an exact value for p is the finite size 
of the dataset: the data collection started at a specihc point in time and so any hashtags that are 
in fact copied from tweets received prior to the start date will be erroneously counted as “distinct 
hashtags” in the estimate, thus leading to an overestimate of the true innovation probability. 


6.2 Results using identical user activity rates 

Using the empirical Twitter out-degree distribution we apply the analytical results of Eqs. (HOll 
and (jA.lip (which assume f3jk = 1) to fit the model to the data in Eig. [6l Eigure [6]A and EB 
show that the model-predicted age-dependent popularity distributions match reasonably well to 
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Figure 6: Comparison of the model with Twitter hashtags data. (A) CCDFs for popularity of 
hashtags at age a (at time a after their first appearance in the dataset). The model CCDFs (from 
Eq. lA.lip are multiplied by 10 for clarity. Model parameters are: A = 4.5 x 10“^, yt = 0.033, 
kc = 0.25, 9 = 500, with one model time unit corresponding to 0.16 days (B) CCDFs at age 
a, each divided by the mean popularity at age a. The data shows an apparent collapse onto a 
single curve that is closely matched by the model. (C) The mean popularity of hashtags of age a. 
(D) The fraction qi{a) of hashtags that are not retweeted by age a. Here, our basic model with 
homogeneous user activity rates does not fit well to the data (but see Fig. [7]). 


the data, and Fig. [6lC shows that the age-dependent mean can be fitted very closely by the model. 
The data collapse seen in Fig. [HB is intriguing, and we analyze it further in Sec. 16.31 below. 

Despite these successes, it was not possible to successfully fit the gi(a) curve (Fig.[6p) using the 
simplified version of the model in which all users have the same activity rates. In Sec. 16.41 below, 
we therefore investigate the effect of heterogeneous activity rates, and show that an improved fit 
can be obtained using more realistic rates. 

6.3 Analysis of the data collapse in Fig. [6]B 

As shown in Fig. [6l3, the ratio qn{a)/m{a) is approximately independent of the age a, giving a 
collapse of the popularity distribution data (and of the model predictions) onto a single curve. 
As in Sec. 031 the large-n asymptotics of the popularity distribution are found from the small-u) 
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expansion (with it; = 1 — x) of h{a; x) = 1 — H{a-, x), and for the scale-free out-degree distribution 
we obtain from Eq. (lA.lOp (using the final value theorem for Laplace transforms) the following 
asymptotic behavior in the a —)■ oo limit: 


/i(oo; 1 


w) ~ {Xz -|- l)^ 


1 1 
7-1 UJ 7-1 


as u; —>■ 0 . 


(54) 


Understanding the large-a approach to this steady state (i.e., the case where a is large but finite) 
is a difficult problem in asymptotic analysis, involving the double limits n —)• oo and a —)• oo. 
However, some insight can be obtained by factoring the function h into a product of its infinite- 
age limit /i(oo; x) and another function hi, with hi limiting to 1 as a —)• oo: 


h{a;x) = h{oo; x)hi{a-, x). 


(55) 


Taking Laplace transforms gives 


h{s]x) = h{oo-, x)hi{s; x), 


(56) 


where 


Xz{s + Xz + 4*(s)) (7 — 1 )AL) 7 -i [r(l — 7)]7-i t(; 7 -i$(s) 

s{Xz + l)(s + Xz) Xz^{s) -h (7 - l)XD^ [r(l - 7)]^ t(;^4>(s) 
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In particular, note that /ii(s;l — w) depends on w only through the factor rcr-i. In the case 
where 7 is very close to 2 , the exponent (7 — 2)/(7 — 1 ) of the w dependence is close to zero, 
and the dependence of hi on w is therefore very weak. It follows that the rate of approach of the 
corresponding distribution qn{a) to the steady state qn{oo) does not show a strong dependence 
on n, and the CCDFs for various ages appear almost parallel in the log-log plot of Fig. [6l4 (note 
7 = 2.13 in the Twitter network). 

As we saw in Sec. 14.31 for the large-age asymptotics of the mean popularity, the long-time 
behavior of the popularity distribution may be obtained by examining the linear (early-age) 
growth of the inverse transform of Eq. (lA.llI) . The resulting popularity distributions qn{a) show 
(for large n) a regime of linear-in-age growth, and in the case where 7 ~ 2 , the rate of this growth 
depends only weakly on n. Since the mean popularity m(a) is also growing linearly during this 
age period (see Eq. (1431) 1. the division of the CCDFs at various ages by the corresponding mean 
m{a) leads to the collapse of the data onto the single curve that is seen in Fig. [6f3- 


6.4 Heterogeneous activity rates 

Although our analysis methods are quite general, in order to focus on understanding the combined 
effects of memory and out-degree distribution most of our results thus far are specialized to the 
case of uniform user activity rates, Pjk = 1. It is interesting, therefore, to examine the impact 
that more realistic heterogeneous activity rates would have upon the results we have obtained. 
To this end, we extend here to the case where the activity rate of a user depends on its out-degree 
k while retaining the assumption pjk = 5j,zPk-, so that = h (normalized so that Yhk PkPk = 1 

and with /3 = Ylk ^hPk)- 
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The mean popularity is given in the general case by Eq. (l36t) . Repeating the asymptotic 
analysis of leading to Eq. (143 p for the /r —>• 0 limit, we again find linear growth of m{a) with age 
a, with a slope that generalizes that found in Eq. ()43p : 


m{a) ~ 


TXz'^ + 1 


as a —)■ oo, 


(58) 


where we have introduced the notation = Ylk I Vk- 

If we additionally assume that the user activity rates saturate to a constant level /3oo at very 
large A;, so f3k —)• /3oo as k ^ oo, then we can repeat the asymptotic approximations of Sec. 14.51 to 
determine a generalized version of Eq. (jA.lip : 


H{s] x) ~ 


1 

s 


1 

(s + Xzj]) 


Xz/3{s + XzP + ) (7 - 1)(1 - x)$(s) 


7-2 » 


s + XzP — Xz(3^{s) + fiSo (7 — 1)AL)t'-i [r(l — 7 )]r-i (1 — x)t'-i4)(s) 


(59) 


To demonstrate the effect of heterogeneous activity rates, we consider a model for j5k inspired 
by the data analysis shown in Fig. 6(a) of |38j . see Appendix O for details. Using this hetero¬ 
geneous activity rate. Fig. [7] shows results that correspond closely to the homogeneous-activity 
example of Fig. [6l A comparison of panels D from both figures clearly shows that including 
heterogeneous activity rates leads to a better fit of the model to the data on the fraction gi(a) 
of non-retweeted memes. However, the other results of the model (panels A, B and C of Fig. [7] 
compared to same panels in Fig. [6P are relatively unaffected by the activity rate, so that the good 
matches between model and data seen in Fig. [6] are not compromised by including heterogeneity 
in activity rates. 


7 Limitations of the model 

As we have demonstrated, the analytical tractability of the null model enables it to be fitted 
to time-dependent data on meme popularity. However, we were required to make a number of 
assumptions to obtain analytical results and in this section we briefly highlight the most important 
assumptions and discuss possible extensions to the model. 

The network structure is assumed to be that of a directed configuration-model graph defined 
by the joint probability pjk of a node having in-degree j and out-degree (number of followers) 
k. While this joint probability can encode correlations between the number followed by, and the 
number of followers of, a node, it does not incorporate edge-based correlations, i.e., the probability 
that a user with many followers is followed by users who also have high numbers of followers. It 
may be possible to extend the analysis of the model to deal with at least some types of edge 
correlation [67l[68], but this would be at the cost of increased complexity of the equations. 

A more unrealistic simplification of the configuration model is the fact that it generates net¬ 
works that are locally tree-like, with few short cycles. In particular, our model does not include 
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Figure 7: As Fig. [6l but including heterogeneous activity rates /3fc given by Eq. (jC.l|) . Model 
parameters are: A = 5 x 10“^, fi = 0.033, kc = 0.25, 6 = 500, with one model time unit 
corresponding to 0.18 days. 
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bidirectional edges (i.e., reciprocated following relationships, where nser A follows user B and B 
also follows A), which are quite common in the Twitter network [69], but which violate the inde¬ 
pendence assumption of a branching process. However, numerical simulations in Ref. m using 
a real Twitter network for a zero-memory version of the model (Sec. S4 of [37]) gave quite good 
agreement with branching process theory, despite the presence of a large fraction of reciprocal 
links in the graph. The conditions under which tree-based theories give good approximations 
for dynamics on non-tree-like networks remains an active area of research m and more work is 
required for further understanding. 

An important assumption of the null model is that all memes have equal fitness. This is 
consistent with random-copying models of human decision-making [361171] where the quality of 
the product—here, the “interestingness” of the meme—is less important than the social influence 
of peers’ decisions m- This neutrality of the model is at the root of the criticality of the dynamical 
system [39]. A related (discrete-time) model for the number of citations gained by scientific papers 
was analyzed in Ref. m, where the authors also extended their neutral model to include unequal 
fitnesses of papers. R is likely that our model could be extended in a similar way, to incorporate 
a fitness parameter for each individual meme. Based on the results of Ref. [19] . we expect that 
our main results would be qualitatively unaffected if the distribution of fitness values over the set 
of all memes is strongly peaked (i.e., if most memes have roughly equal htness values, with only 
the high-fitness outliers demonstrating supercritical popularity growth). 

Perhaps the most unrealistic aspect of the current model is the assumption that all users 
have constant activity rates, so their tweeting activity is described by a Poisson process (see the 
discussion in Sec. I3.2|l . It would be interesting to relax this assumption, for example to allow 
the activity of users to be described by models such as that of Ref. m or by inhomogeneous 
Poisson processes: the latter incorporates time-varying activity rates and so could model the 24- 
hour variability in tweeting levels determined by daily patterns m- However, we believe that 
the near-critical aspect of the model will not be strongly affected by such generalizations. To see 
this, consider the heuristic derivation of the branching number ^ that was described at the end 
of Sec. 14.11 Over a sufficiently long time window W, the expected number of interesting memes 
received into the stream of a (j, A:)-class user is linear in the number j of users followed, and this 
remains true even for inhomogeneous Poisson (or even non-Poisson) activities, provided that the 
observation window is long enough (e.g., such that the average rate j3 of incoming tweets should 
yield approximately similar values when time-averaged over disjoint time windows of length W). 
Similarly, the expected number of retweets by the user during the time window can be written as 
in Eq. (1231) . but with the Poisson rate replaced by its time-averaged value. The calculations of 
Eq. (|24|) then proceed as before, leading to the conclusion that the branching number limits to the 
critical value of one as /r —)• 0, which implies that non-Poisson user activity rates (or burstiness) 
will not affect the criticality of the model, which is a long-time (i.e., ages of memes limit to 
infinity) characteristic. Of course, the short-term behaviour of the model (such as the small-a 
behaviour in panels B-E of Eig. [4]) would be affected by introducing burstiness; incorporating 
such realistic features into the model is left as a challenge for further work. As a final comment 
on this topic, we note that the agreement (in Sec. [6]) of our theoretical results with real data of a 
spreading process for which users’ activity rates are not constant also provides indirect evidence 
that the phenomenology discussed is robust to the details of user activity burstiness. 

The heuristic calculation of the branching number considered at the end of Sec. l4.1l also offers 
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a clue as to how the model can be extended to the spreading of information on undirected social 
networks (as opposed to the directed networks that we focus on in this paper). Of particular 
interest is the spreading of app-adoption on Facebook, for which data was analyzed in Ref. [TO] 
and a computational model was introduced in Ref. [5]. If the Facebook update messages that 
inform all friends of user A that she has installed a particular app are considered to be the memes 
in a version of our model, then the arguments of Sec. l4.1l need only slight modifications. The total 
number of update messages received in the stream of a user with k Facebook friends is linear in k 
(i.e., the j in the denominator of Eq. (I22p is replaced by k), while the expected number of friends 
who would be interested in user ^’s adoption of the app is X{k — 1) (since one friend out of k 
must have adopted before A in order to have spread the message to her). Following very similar 
steps to calculate the expected number ^ of children of a meme—see the calculations leading to 
Eq. (pTj) — we find that 


> k l3kX{k — 1) 

sundirected ^ / Pk , -tt . P' ^ 0 

= 1 - 1 . ( 60 ) 

Although this branching number is less than one, the mean number z of friends on Eacebook is 
large (e.g.. Ref. m calculated z « 190) so that ^undirected is in fact very close to unity, implying 
that the information spread process is close to criticality. Such a near-critical branching process 
was hypothesized in Ref. [5| to explain to observed fat-tailed distributions of app popularity in 
Eacebook data and the temporal characteristics of the adoption behaviour. The cascade sizes 
for other forms of “meme” spreading on Eacebook have also been observed to have fat-tailed 
distributions [TT]. Other undirected networks to which the model should be applicable include 
YouTube [63] and Digg [2117^. 

Einally, our focus here has been on the statistical physics of the model, but for completeness 
we should note the difficulties inherent in applying the model to data sets where memes may 
not be as simple to recognize and track as hashtags are. In [21] for example, the process of 
extracting memes (representing popular scientific terms) from data (citation archives of scientific 
publications) is explained in detail, and considerable such effort will generally be required to 
identify and track the memes to which this null model might be applied. 

A related question is whether the popularity of online memes has any implications in terms 
of mass social movements in the offline world. This is a complex question that lies beyond the 
scope of this paper, but we note that Eig. 3 of m shows that the usage of hashtags related to the 
15M Spanish protest movement was found to be closely correlated with the number of protest- 
related headlines in newspapers, at least during the main activity of the protests. This indicates 
that online social spreading phenomena can, at least in some cases, give useful information about 
real-world social movements and activism. 

8 Discussion 

The extremely wide range of popularities achieved by items on social media poses many challenges 
for complex systems researchers. These include the identification of the causes m and structural 
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features m of “viral” propagation, and the prediction of future spreading based on the content 
or the early-time growth of memes [4l ll61l791l8nj . each of which are important in the design of more 
efficient systems to spread information (e.g. in case of emergency). We argue that null models are 
fundamentally important in this quest—and complement more data-driven approaches—as they 
demonstrate, for example, that extreme popularity can arise purely because of random fluctuations 
in the competition between memes for user attention. While the content of a meme may well 
be an important factor in its popularity (or predictability [79]), definitive statements about the 
significance of such factors should be referenced to an appropriate null model. 

In this paper we have introduced and analyzed a null model of meme spreading that is an¬ 
alytically tractable, yet realistic enough to reproduce several characteristic features of empirical 
data. The model is sufficiently general to incorporate heterogeneous user activity rates and a 
joint distribution pj^ of the number of users followed j and the number of followers k, as well 
as a memory-time distribution $ that gives non-Markovian dynamics. The competition-induced 
criticality phenomenon identified in a zero-memory model in Ref. m is found to be robust to 
the generalizations, giving power-law popularity distributions with characteristic time-dependence 
similar to data from social spreading phenomena (and see Sec.[7]for a discussion of further possible 
extensions of the model). 

The analytical tractability enables fast fitting of the model to data, as demonstrated in Sec. [6] 
with hashtag data from Twitter. We hnd that a simplihed version of the model where users all 
have the same activity rate can be fitted to some, but not all, aspects of the data (see Fig. [6|). 
The aim of a null model is not to perfectly reproduce every aspect of a dataset, but rather to 
help identify which features of the data can be reproduced using relatively simple models, and 
so to highlight aspects where more detailed modelling (or, perhaps, factors entirely outside the 
model) are required to match to data. In this respect, the null model highlights the fact that 
heterogeneity in activity rates is vital to accurately capturing the qi{a) curve (compare Figs. [6p 
and ED), even though the time-dependence of the bulk of the popularity distribution may be 
described reasonably well by a model with homogeneous activity rates (Fig. [6l4-C). 

As noted in the Introduction, and expanded upon in Sec. [T] our definition of “memes” is suffi¬ 
ciently general to enable the model to be applied (with minor changes) not just to the spreading 
of hashtags or URLs on Twitter, but also to the adoption of apps on Facebook, the popularity 
of videos on YouTube, and to the broad range of imitation-driven spreading dynamics. We an¬ 
ticipate that the analytical results and potential for fast fitting to data will make this null model 
a useful tool for further work, and we hope it will contribute to the ongoing investigation of the 
entangled effects of memory, network structure, and competition on social spreading phenomena. 
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A Calculation of the large-a, large-n asymptotics of popularity 
distribution 


In this Appendix we consider how the popnlarity distribution qn{a) behaves for large, but hnite, 
ages. To highlight the effect of the out-degree distribution npon the results we here restrict 
our analysis to the case = 1, pjfc = 6j^zPk- Taking the // —)• 0 limit, Eq. (fT5]) becomes 


COO 

G{a]x) = 'y2Pk / diXze~^^^x 
u Jo 

r*min(£,a) pa—r 


k 

X exp 


pmm[t,a) pa—r 

/ df df <I>(o — f — f)(l — X [1 — A-|-AG(f; x)]^) 

Jo Jo 


(A.l) 


Writing x = 1 — w and G(a;x) = 1 — we observe that the argnment of the exponential 

fnnction vanishes when w = 0 and 4> = 0, and so we consider the small-re, small-(/> asymptotic 
behavior by expanding the exponential term to hrst order in its argument: 


poo p 

4>{a-,w) « / diXze~^^^ 

Jo Jo 


mm{i,a) 


df 


df <I>(a — f — f)(l — (1 — re) ^^Pk [1 — A^(f; w)]^) 

k 


(A.2) 

We note that retaining only the hrst-order term in the expansion of the exponential is an approxi¬ 
mation. We will estimate the accuracy of this “one-term expansion” by comparing the inhnite-age 
limit determined nnder the approximation with the corresponding exact valnes as given in Sec. 14.41 
For the case of a scale-free ont-degree distribution with ~ Dk~^ as A: —)• oo, and 7 in the 
range 2 < 7 < 3, the asymptotic form of the snmmation term in Eq. (IA.2P is given by [37] 


1 — (1 — tc) '^^Pk [1 ~ ~ ^^4 — C4'^ ^ tc -|- o{'w, 4) as tc —)• 0, (/) —)• 0, (A.3) 

k 


with the constant G given by G = A'^“^T)r(l — 7 ). Applying the integral-swapping trick of 
Eq. (|34p allows the right hand side of Eq. (IA.2p to be expressed as a donble convolntion integral. 
Laplace transforming then yields 

4{s-, w) = —4>(s)£ [Xz4 — C4'^~^ + w\ , (A.4) 

Xz s 


where C denotes the Laplace transform operation applied to the term in square brackets. In the 
a —>■ 00 limit, this equation is satisfied by the steady-state solution 


4{pO]w) = C 


(A.5) 
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Figure 8 : The function F{C, 7 ), as defined in Eq. (IA. 6 h . for values of 7 close to 2. The highlighted 
points are the parameter values that are relevant to Fig. [3] {Xz = 11, 7 = 2.5) and to Fig. [U] 
{Xz = 0.32, 7 = 2.13). In all cases of interest the values of F are close to 1, and so we conclude 
that the one-term expansion used in Eq. ()A.2ji gives a good estimate of the exact steady-state 
solution. 


as can be verihed using the final value theorem for Laplace transforms. We note that the cor¬ 
responding expression for w) as calculated from the steady state Eq. (HOl) has a additional 
multiplicative factor of F{Xz,'y) that is absent in Eq. (IA.5I) . where the function ^(^, 7 ) is defined 
by 





rF ^ 

(C + 1)"+^ 


1 

7-1 


(A.6) 


see Fig. [H If Az 7> 1, then F{Xz,'y) ^ 1 and the one-term expansion gives results that are very 
close to the exact values (at least in the infinite-age limit a —)• 00 ). Moreover, even if Az is not 
large (e.g., Xz = 0.32 for the model fit to Twitter hashtags data in Sec. [ 6 |), the values of F{Xz,'y) 
can still be close to unity if 7 is sufficiently close to 2 . 

To consider small deviations from the steady state, we define g{a] w) by 


cj){a] w) = 4>{oo] w){l- g{a] w)) 


(A.7) 


with g{a; w) —>■ 0 as a —)• 00 . Assuming that g is sufficiently small to allow the use of the linearizing 
approximation 

(1 - 5)'^"^ ~ 1 - (7 - 1)5, (A.8) 

Eq. (|A.4I) can be solved for the Laplace transform of g: 


1 s -|- A^; — A2:4 >(s) 

g[s-,w) = - -:-— 

* s -|- Az — Az$(s) -l- (7 — FjC 7-1 tcT-i <h(s) 


(A.9) 


The Laplace transform of (j) then follows from Eq. (lA.Tji and a similar asymptotic analysis of 
Eq. (fT71) yields 


H{s; I — rc) 
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Substituting from Eqs. ()A.7I) and (IA.9I) results in 


H{s] x) = 


1 


1 - 


A 2 : + Az + (7 — 1)(1 — x)l>(s) 


7-2 


{s + Xz) ( s + A2: — A2:$(s) + (7 — l)AiAT'-i [r(l — 7 )] T"-! (1 — $(s 


(A.ll) 

A similar analysis can be performed in the case where the out-degree distribution pk has finite 
second moment. We again utilize a one-term expansion similar to Eq. (IA.2h . but we can also 
retain a non-vanishing innovation probability p in this case. The one-term expansion can be 
shown to be accurate when Az S> 1 ; this condition is obeyed in all relevant cases we examine. 
The resnlting large-a asymptotics for the generating function H{a]x) are found by inverting the 
following Laplace transform: 


His-,1 — w) = - 4>{oo] w)x 

s 

(1 - fi)Xz{s + Xz + p + ^>('S)) 
s{s + Xz + p) 


_2(1 - _' 

s + Xz + p - (Az(l + p) + 2/i) l>(s) -I- 2(1 - p) 

(A.12) 


with <^( 00 ; w) given by 

. -p{Xz -h 1) -h pfl{Xz + 1)^ 2A2(1 - //)2 ((/c2) - z)w 

=-wwIhMvt- 


(A.13) 


B Numerical inversion of Laplace transforms and PGFs 


Many of our results for the popularity distribution qn{o) are expressed in terms of the corre¬ 
sponding PGF H{a;x). As in [37], we use the Fast Fourier Transform method of |461I811[83] to 
numerically invert the PGF at a fixed age a to produce, for example, the model distributions 
in Figs. [ 6 ] and [71 see Sec. S2 of m for further details and links to Octave/Matlab code for 
implementing the PGF inversion. 

The results of the model for the age-dependence of several quantities are expressed in terms 
of Laplace transforms. To numerically invert the Laplace transforms we use the efficient Talbot 
algorithm [M], in its simplified version described in Sec. 6 of [85] . The Talbot algorithm is based 
on a numerical evaluation of the Bromwich (Laplace inversion) integral, using a cleverly-chosen 
deformation of the contour in the complex-s plane. The Laplace inversion of H(s;x) to obtain 
H{a] x) at a desired age a, for example, can be quickly computed using the 2Ml — 1 weights 7 ^ 
and nodes 6k dehned by [ 86 ] 
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(where i = V—1) by calculating the sum 
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In practice, the precision of the Talbot algorithm is very high, and only relatively small values of 
Ml are required to obtain accurate results; we used Mi = 25 in the examples shown. 


C Model of heterogeneous activity rates 

In the data analysis of Fig. 6(a) of [38], the average activity rate (as measured by the number of 
tweets by a user in a fixed time period) is found to grow approximately linearly with the number 
of followers k of that user, for k from 0 to about 100. Then, for k values from about 100 up to the 
maximum shown in the plot {k = 10^), the activity rate grows as a more slowly increasing linear 
function of k. We model these characteristics (which are also seen in other studies, e.g., [87jl. 
using a piecewise-linear and continuous function of k, assuming a saturation of activity at very 
high k, as follows: 

( 0.35k if fe < 100, 

/3k (X < 35 + 0.044(A: - 100) if 100 < A: < 10^. (C.l) 

[ 470.6 if A: > 10"^, 

where the values are chosen to closely match the linear growth rates in Fig. 6(a) of [38|, snd the 
constant of proportionality being set by the condition = 1- 
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